Automatic Detection of Antecedents of Japanese Zero Pronouns Using a Japanese-English Bilingual Corpus

نویسندگان

  • Dong Zhan
  • Hiromi Nakaiwa
چکیده

In this paper we present a method of detecting zero pronouns in Japanese clauses and identifying their antecedents using aligned sentence pairs from a Japanese-English bilingual corpus and open resource tools. We use syntactic and semantic structures and the alignment of words and phrases in the sentence pairs to automatically detect zero pronouns and determine their antecedents using English translations. We build rules to link antecedents with zero pronouns and create filters to remove problematic sentence pairs. Experimental results confirm the effectiveness of our method. The proposed method allows the construction of an annotated corpus of zero pronoun sentences in which the antecedents of the missing pronouns are flagged. This would be very useful for machine translation (MT), because zero pronoun detection is a vital problem when translating languages which allow zero pronouns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Extraction Of Rules For Anaphora Resolution Of Japanese Zero Pronouns From Aligned Sentence Pairs

This paper proposes a method to extract rules for anaphora resolution of Japanese zero pronouns from aligned sentence pairs. The method focuses on the characteristics of Japanese and English in which both the language families and the distribution of zero pronouns are very different. In this method, zero pronouns in the Japanese sentence and the English translation equivalents of their antecede...

متن کامل

Anaphora Resolution of Japanese Zero Pronouns with Deictic Reference

This paper proposes a method to resolve the reference of deictic Japanese zero pronouns which can be implemented in a practical machine translation system. This method focuses on semantic and pragmatic constraints such as semantic constraints on cases, modal expressions, verbal semantic attributes and conjunctions to determine the deictic reference of Japanese zero pronouns. This method is high...

متن کامل

Automatic Identification of Zero Pronouns and their Antecedents within Aligned Sentence Pairs

This paper proposes a method to identify zero pronouns within a ~]apansse sentence and their antecedent equivalents within the corresponding English sentence from aligned sentence pairs. The method focuses on the characteristics of Japanese and English, in two languages from cHfBerent f~rngles and in which distribution of zero pronouns is very d.uTerent. In this method, the Japanese sentence an...

متن کامل

A Probabilistic Method for Analyzing Japanese Anaphora Integrating Zero Pronoun Detection and Resolution

This paper proposes a method to analyze Japanese anaphora, in which zero pronouns (omitted obligatory cases) are used to refer to preceding entities (antecedents). Unlike the case of general coreference resolution, zero pronouns have to be detected prior to resolution because they are not expressed in discourse. Our method integrates two probability parameters to perform zero pronoun detection ...

متن کامل

Identif icat ion of Zero Pronouns and their Antecedent s within Al igned Sentence Pairs

This paper proposes a method to identify zero pronouns within a ~]apansse sentence and their antecedent equivalents within the corresponding English sentence from aligned sentence pairs. The method focuses on the characteristics of Japanese and English, in two languages from cHfBerent f~rngles and in which distribution of zero pronouns is very d.uTerent. In this method, the Japanese sentence an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015